CDS

Accession Number TCMCG078C26599
gbkey CDS
Protein Id KAG0497360.1
Location join(35396956..35396966,35398903..35399038,35403599..35403688,35403813..35403917,35404016..35404117,35407562..35407646,35415448..35415514,35415600..35415644,35420933..35422589)
Organism Vanilla planifolia
locus_tag HPP92_002051

Protein

Length 765aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000001.1
Definition hypothetical protein HPP92_002051 [Vanilla planifolia]
Locus_tag HPP92_002051

EGGNOG-MAPPER Annotation

COG_category L
Description breast cancer carboxy-terminal domain
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko03032        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K10728        [VIEW IN KEGG]
EC -
KEGG_Pathway ko03440        [VIEW IN KEGG]
map03440        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGTTTTCTAGTGCTAATATTCCATCAGTGGTTGTAGAGGCAATGCACATAGATAGAGGTGGTGCTGCGACTTCGGGCAGCCCGAAGAAGCTTTTCGATGGTGTTCGCTTCGTCCTCTTCGGATTTGACTCCGTTTCGGAGTCTCAGTATCGAAGTGAGCTCATGAATGGTGGGGGAATTGATGTTGGCCGGTATGACTCTAGTTGCACGCATGTGATTGTCTCTGGTCGTGTTTATGACGATCCTGTTTGTGTTTTGGCAAGTAACGATGGGAAAATTCTTGTTACAGAATTGTGGATTGATGACAGTTTGAACTTTGGCATGCTTGCAGATTCAACCAAGGTTTTGTATAGACCAGTTAGAGATTTGAATGGGATTCCTGGTAGCGAGTTAATACACATTTGTTTAACAGGATATCAGAGGCAAGAACGTGATGATATCATGAAAATGGTTTCTTTGATGGGAGCACGTTTTTCAAAGCCTTTGATAGCAAACCAGGTCACCCATCTCATTTGCTACAAGTTTGAAGGTGAGAAGTATGAACTTGCCAAAAGAGTCAATATAAAACTTGTCAATCACCATTGGTTGGAGGATTGCTTGACGACTTGGGAGATCTTGCCAATTGATAAATATACTAAAAGTGCGTGGGAGTTAGAGGTGTTGGAAGCCCAGGCTATGGATTCCGAAGAAGAAAATGATGGGGGTGGCAGAAAGTTTGCAGAAGAGGGAAGTATTGCACAACCTAGCAATTCACGAGGTGCAGTGCCAGTTAAAGTTGCTCCAGGTGTTCTGATGCACGATAACAGAGACATGGGCCTGCTGAATAGTACTGTACCTATAAAGCCACCAAATCTTTCTACCAATAATAAATTGTTTTCTCTTCCATGCGGAGACGGCAGTTCTCAGAAAGCTAATGATTTTTGTAACAGTAATAGCAATTTGCAGGGCAGAACTGAAAATGTTCGAGATGATCATGGCATTGTTGGTGATATCGTTAGCAAACAATCAAGTACCTCAGATTATGTAAATGTAAGCAGGAAATGCATTGGACTTCCCTCTGATGTGAGTATAACTTTGATTAGTACCAGTTCACTGGAAGAGGACAAGAAATTGAGCCTTTCATGTTACAGTAGCAAAGTTCCTGAGAGAGTTGTTTCGCCTGAAGAGAAAATGGAAGAAACAAAAGCAAACATAAATTCTGACATTAGCTCCACAAAATCAAATGCTTCTGCAAAACTGTGCATGTCTGATGATTTGTGTACCCCTTTGAGTGGCACAAACACATCAAATGTGGAAGGCCGAACTTGTTCCTTGCCCCAGAAGCGTAAGCTATCCGTATTGCGTCTAAAGCAGGATCAGATTTCAAGAACGCCTAGCTTGATGGATACCCCGAATATTTTTTCTGCAAAACCAAATAACATGAAGCCTAGAGGCTTTACAGCAGATAACATGCCTTTTCATAATGATGCTAGTGGTAGTGTCTGCAGCGTGATAGAACAGAGTGATGCATTTCCTCAAAAGCAAAAGCAATGTATTTCCAGCAATATTAAAGAATCAGCAGAATTAGCACTGCAAACTTCTAGTACTTGCACCAAACAGAACCCTTCAACAAACATAGCTGTGCCATACAAAGAAACTGATGCAGGTTTTACTCGTAATGGTAATTTTGAAGCGACTACTCCAGGAAATCCAGGAGTAAATCAACAGGATGCTGTGTCGAAACGCAGGTCCTCGATCTACAAGAGGAAGTCCATTAAAGACTATTCTTATTCTTCATTAGTCAAATTTTCCGGTGAGTTCACTCCAGCGTCGGATCACAATGATGAATCATTTAGCAACGGATTTCCACCAAAACCAGTATTAGATATGAAAACTAGTGAAACATCGGGGGAGATGGCTGGAGTTAGTAATATGTATAGTATGAGCTCTCGTGATGGCAAGTTGGTCTCAGATTGTCCAGTGATTAATGAAAGCGGGCAGATTATGAATACAAGTGCAGGAAGACATGCAAACTTTCTGAGATCATGGCCTGCTTCTGAAGTTGCAGATGGAAACGAGGCTTCAAATAGTATGAATTTTCTTCAGCAGGAAAACTTGAGGGAAACTGAAGCTGCTGTAAAGTTTGAGAGTCATAATGTTGTTGCATTGAGATGTAAATTAGATCTTCATTCTGAAACAACTTCTCATCGTGACAGAGCACACGAACACCCGGAGGTTTTACCATCTGATTTGAGTGAGAAAAAGGTGGAAACTTGTGCAGAGATTCCTGGAGAGTGCAAACAGCAAAAAATCAAATGA
Protein:  
MFSSANIPSVVVEAMHIDRGGAATSGSPKKLFDGVRFVLFGFDSVSESQYRSELMNGGGIDVGRYDSSCTHVIVSGRVYDDPVCVLASNDGKILVTELWIDDSLNFGMLADSTKVLYRPVRDLNGIPGSELIHICLTGYQRQERDDIMKMVSLMGARFSKPLIANQVTHLICYKFEGEKYELAKRVNIKLVNHHWLEDCLTTWEILPIDKYTKSAWELEVLEAQAMDSEEENDGGGRKFAEEGSIAQPSNSRGAVPVKVAPGVLMHDNRDMGLLNSTVPIKPPNLSTNNKLFSLPCGDGSSQKANDFCNSNSNLQGRTENVRDDHGIVGDIVSKQSSTSDYVNVSRKCIGLPSDVSITLISTSSLEEDKKLSLSCYSSKVPERVVSPEEKMEETKANINSDISSTKSNASAKLCMSDDLCTPLSGTNTSNVEGRTCSLPQKRKLSVLRLKQDQISRTPSLMDTPNIFSAKPNNMKPRGFTADNMPFHNDASGSVCSVIEQSDAFPQKQKQCISSNIKESAELALQTSSTCTKQNPSTNIAVPYKETDAGFTRNGNFEATTPGNPGVNQQDAVSKRRSSIYKRKSIKDYSYSSLVKFSGEFTPASDHNDESFSNGFPPKPVLDMKTSETSGEMAGVSNMYSMSSRDGKLVSDCPVINESGQIMNTSAGRHANFLRSWPASEVADGNEASNSMNFLQQENLRETEAAVKFESHNVVALRCKLDLHSETTSHRDRAHEHPEVLPSDLSEKKVETCAEIPGECKQQKIK